Picture for Haiyue Zhang

Haiyue Zhang

AgentIR: A Workload-Adaptive Cascade Retrieval Substrate for Long-Term Conversational Memory

Add code
May 24, 2026
Viaarxiv icon

When Simulation Lies: A Sim-to-Real Benchmark and Domain-Randomized RL Recipe for Tool-Use Agents

Add code
May 12, 2026
Viaarxiv icon

Hidden Error Awareness in Chain-of-Thought Reasoning: The Signal Is Diagnostic, Not Causal

Add code
May 10, 2026
Viaarxiv icon

OVS-DINO: Open-Vocabulary Segmentation via Structure-Aligned SAM-DINO with Language Guidance

Add code
Apr 09, 2026
Viaarxiv icon

Auditable Agents

Add code
Apr 07, 2026
Viaarxiv icon

Agent Audit: A Security Analysis System for LLM Agent Applications

Add code
Mar 24, 2026
Viaarxiv icon